Joint Recognition and Segmentation of Actions via Probabilistic Integration of Spatio-Temporal Fisher Vectors

نویسندگان

  • Johanna Carvajal
  • Chris McCool
  • Brian C. Lovell
  • Conrad Sanderson
چکیده

We propose a hierarchical approach to multi-action recognition that performs joint classification and segmentation. A given video (containing several consecutive actions) is processed via a sequence of overlapping temporal windows. Each frame in a temporal window is represented through selective lowlevel spatio-temporal features which efficiently capture relevant local dynamics. Features from each window are represented as a Fisher vector, which captures first and second order statistics. Instead of directly classifying each Fisher vector, it is converted into a vector of class probabilities. The final classification decision for each frame is then obtained by integrating the class probabilities at the frame level, which exploits the overlapping of the temporal windows. Experiments were performed on two datasets: s-KTH (a stitched version of the KTH dataset to simulate multi-actions), and the challenging CMU-MMAC dataset. On s-KTH, the proposed approach achieves an accuracy of 85.0%, significantly outperforming two recent approaches based on GMMs and HMMs which obtained 78.3% and 71.2%, respectively. On CMU-MMAC, the proposed approach achieves an accuracy of 40.9%, outperforming the GMM and HMM approaches which obtained 33.7% and 38.4%, respectively. Furthermore, the proposed system is on average 40 times faster than the GMM based approach.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Probabilistic Framework for Recognizing Similar Actions using Spatio-Temporal Features

One of the challenges found in recent methods for action recognition has been to classify ambiguous actions successfully . In the case of methods that use spatio-temporal features this phenomenon is observed when two actions generate similar feature types. Ideally, a probabilistic classification method would be based on a model of the full joint distribution of features, but this is computation...

متن کامل

Recognition of Visual Events using Spatio-Temporal Information of the Video Signal

Recognition of visual events as a video analysis task has become popular in machine learning community. While the traditional approaches for detection of video events have been used for a long time, the recently evolved deep learning based methods have revolutionized this area. They have enabled event recognition systems to achieve detection rates which were not reachable by traditional approac...

متن کامل

Segmental Spatio-Temporal CNNs for Fine-grained Action Segmentation and Classification

Joint segmentation and classification of fine-grained actions is important for applications in human-robot interaction, video surveillance, and human skill evaluation. However, despite substantial recent progress in large scale action classification, the performance of state-ofthe-art fine-grained action recognition approaches remains low. In this paper, we propose a new spatio-temporal CNN mod...

متن کامل

Qualitative Recognition of Ongoing Human Action Sequences

This paper deals with the qualitative visual recognition of continuous human action sequences. Based on an analysis of the structure of physical actions, a framework for temporal segmentation and qualitative classi cation of physical actions is proposed. In order to achieve correctness as well as e ciency in real time action recognition, a hierarchical spatio-temporal attention control method i...

متن کامل

Spatio-temporal CNN Algorithm for Object Segmentation and Object Recognition

In this paper a spatio-temporal analogic CNN algorithm is designed for front-end filtering, segmentation and object recognition. First, a generalized segmentation strategy is presented based on various diffusion models. Both PDE and non-PDE related schemes are discussed and their VLSI complexity is analyzed. In classification (object recognition) a CNN implementation of the autowave metric, a “...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016